Compressive statistical learning with random feature moments

نویسندگان

چکیده

We describe a general framework --compressive statistical learning-- for resource-efficient large-scale learning: the training collection is compressed in one pass into low-dimensional sketch (a vector of random empirical generalized moments) that captures information relevant to considered learning task. A near-minimizer risk computed from through solution nonlinear least squares problem. investigate sufficient sizes control generalization error this procedure. The illustrated on compressive PCA, clustering, and Gaussian mixture Modeling with fixed known variance. latter two are further developed companion paper.

برای دانلود رایگان متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

Compressive Statistical Learning with Random Feature Moments

We describe a general framework –compressive statistical learning– for resource-efficient largescale learning: the training collection is compressed in one pass into a low-dimensional sketch (a vector of random empirical generalized moments) that captures the information relevant to the considered learning task. A near-minimizer of the risk is computed from the sketch through the solution of a ...

متن کامل

Compressive Feature Learning

This paper addresses the problem of unsupervised feature learning for text data. Our method is grounded in the principle of minimum description length and uses a dictionary-based compression scheme to extract a succinct feature set. Specifically, our method finds a set of word k-grams that minimizes the cost of reconstructing the text losslessly. We formulate document compression as a binary op...

متن کامل

Compressive Reinforcement Learning with Oblique Random Projections

Compressive sensing has been rapidly growing as a non-adaptive dimensionality reduction framework, wherein high-dimensional data is projected onto a randomly generated subspace. In this paper we explore a paradigm called compressive reinforcement learning, where approximately optimal policies are computed in a lowdimensional subspace generated from a high-dimensional feature space through rando...

متن کامل

Small Statistical Models by Random Feature Mixing

The application of statistical NLP systems to resource constrained devices is limited by the need to maintain parameters for a large number of features and an alphabet mapping features to parameters. We introduce random feature mixing to eliminate alphabet storage and reduce the number of parameters without severely impacting model performance.

متن کامل

Statistical moments of the random linear transport equation

This paper deals with a numerical scheme to approximate the mth moment of the solution of the one-dimensional random linear transport equation. The initial condition is assumed to be a random function and the transport velocity is a random variable. The scheme is based on local Riemann problem solutions and Godunov’s method. We show that the scheme is stable and consistent with an advective-dif...

متن کامل

ذخیره در منابع من

ذخیره در منابع من قبلا به منابع من ذحیره شده

{@ msg_add @}

با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

ژورنال

عنوان ژورنال: Mathematical statistics and learning

سال: 2021

ISSN: ['2520-2316', '2520-2324']

DOI: https://doi.org/10.4171/msl/20